Multi-source TDOA estimation in reverberant audio using angular spectra and clustering
نویسندگان
چکیده
We consider the problem of estimating the time differences of arrival (TDOAs) of multiple sources from a two-channel reverberant audio signal. While several clustering-based or angular spectrum-based methods have been proposed in the literature, only relatively small-scale experimental evaluations restricted to either category of methods have been carried out so far. We design and conduct the first large-scale experimental evaluation of these methods and investigate a two-step procedure combining angular spectra and clustering. In addition, we introduce and evaluate five new TDOA estimation methods inspired from signal-to-noise-ratio (SNR) weighting and probabilistic multi-source modeling techniques that have been successful for anechoic TDOA estimation and audio source separation. The results show that clustering-based methods do not improve upon angular spectrum-based methods. For 5 cm microphone spacing, the best TDOA estimation performance is achieved by one of the proposed SNR-based angular spectrum methods. For larger spacing, a variant of the generalized cross-correlation with phase transform (GCC-PHAT) method performs best.
منابع مشابه
Multi-source localization in reverberant environments by ROOT-MUSIC and clustering
Localization of acoustic sources in reverberant environments by microphone arrays remains a challenging task in audio signal processing. As a matter of fact, most assumptions of commonly adopted models are not met in real applications. Moreover, in practical systems it is not convenient or possible to employ sophisticated and costly architectures, that require precise synchronization and fast d...
متن کاملReliability Measurement of Time Difference of Arrival Estimations for Multiple Sound Source Localization
Time Difference Of Arrival (TDOA) estimates are used for passive acoustic single sound source localization with microphone arrays. The technique of choice in most systems for TDOA estimation is the Generalized Cross-Correlation (GCC) method [1]. For a multi-sound source scenario, cross-correlation terms of the active sound source signals as well as noise and reverberation effects complicate the...
متن کاملConsidering the Second Peak in the Gcc Function for Multi-source Tdoa Estimation with a Microphone Array
Time Difference Of Arrival (TDOA) estimates can be used for passive acoustic multiple sound source localization with microphone arrays. The TDOA estimation is based on the cross-correlation function of two signals of a microphone pair. Existing systems assume only one dominant sound source per analysis frame rejecting the localization information of other active sources present in the acoustic ...
متن کاملBlind Source Separation with Distributed Microphone Pairs Using Permutation Correction by Intra-Pair TDOA Clustering
In this paper, we present a novel framework of distributed microphone array for blind source separation (BSS), where stereo microphones or proximately-placed microphone pairs are distributed. Unlike distributing all microphones individually, the time difference of arrival (TDOA) in the paired channels can be robustly estimated without suffering spatial aliasing. Based on it, sound sources are s...
متن کاملWeighted Spatial Covariance Matrix Estimation for MUSIC Based TDOA Estimation of Speech Source
We study the estimation of time difference of arrival (TDOA) under noisy and reverberant conditions. Conventional TDOA estimation methods such as MUltiple SIgnal Classification (MUSIC) are not robust to noise and reverberation due to the distortion in the spatial covariance matrix (SCM). To address this issue, this paper proposes a robust SCM estimation method, called weighted SCM (WSCM). In th...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Signal Processing
دوره 92 شماره
صفحات -
تاریخ انتشار 2012